Spectral Methods for Thesaurus Construction

نویسندگان

  • Nobuyuki Shimizu
  • Masashi Sugiyama
  • Hiroshi Nakagawa
چکیده

Traditionally, popular synonym acquisition methods are based on the distributional hypothesis, and a metric such as Jaccard coefficients is used to evaluate the similarity between the contexts of words to obtain synonyms for a query. On the other hand, when one tries to compile and clean a thesaurus, one often already has a modest number of synonym relations at hand. Could something be done with a half built thesaurus alone? We propose the use of spectral methods and discuss their relation to other network-based algorithms in natural language processing (NLP), such as PageRank and Bootstrapping. Since compiling a thesaurus is very laborious, we believe that adding the proposed method to the toolkit of thesaurus constructors would significantly ease the pain in accomplishing the task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مسائل اصطلاحنامه سازی در ایران از دیدگاه تهیه کنندگان اصطلاحنامه

Introduction: The present research attempts to study the theoretical foundations of thesaurus construction before and after internet and identify the problems of thesaurus construction in Iran from the point of view of thesaurus makers and translators of the published thesauri.. Methods: The research population was 6 thesaurus makers (AbdolHossein Azaragn, Abbas Hori, Fatemeh Rahadoost, Faribor...

متن کامل

Exploration and Study of Chinese Thesaurus Automation Construction for Digital Libraries

The paper aims to explore Chinese thesaurus automation construction based on the freely available digital library resources. The key methods and study results are presented in the paper. The study adopted the technology of natural language processing to analysis the linguistics characteristics of terms, and combined with statistical analysis to extract the terms from technical literatures. Our ...

متن کامل

Research on Construction Method of Agricultural Domain Ontology

Based on the two major methods for the construction of domain ontology, that is, ontology engineering and thesaurus-based ontology construction, this paper puts forward a construction methodology of agricultural domain ontology based on thesaurus. This paper details all parts of the methodology. Under the guidance of this methodology, we build agricultural domain ontology.

متن کامل

A New Dictionary Construction Method in Sparse Representation Techniques for Target Detection in Hyperspectral Imagery

Hyperspectral data in Remote Sensing which have been gathered with efficient spectral resolution (about 10 nanometer) contain a plethora of spectral bands (roughly 200 bands). Since precious information about the spectral features of target materials can be extracted from these data, they have been used exclusively in hyperspectral target detection. One of the problem associated with the detect...

متن کامل

Improving Context Vector Models by Feature Clustering for Automatic Thesaurus Construction

Thesauruses are useful resources for NLP; however, manual construction of thesaurus is time consuming and suffers low coverage. Automatic thesaurus construction is developed to solve the problem. Conventional way to automatically construct thesaurus is by finding similar words based on context vector models and then organizing similar words into thesaurus structure. But the context vector metho...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEICE Transactions

دوره 93-D  شماره 

صفحات  -

تاریخ انتشار 2010